TECHDOC: Multilingual generation of online and offline instructional text
نویسندگان
چکیده
Project idea Supplying technical documentat ion accompanying a product in multiple languages is a growing problem, particularly in Europe with its legislation on the common market. A huge amount of translation work needs to be done when creating and updating technical documentation. In response to these needs, the T E C H D O C project aims at supporting the creation and maintenance of technical documentat ion by knowledge-based, multilingual generation. The idea is to build up a knowledge base that includes a model of the product in question, and to produce documentat ion in multiple languages automatically. At present, the system produces maintenance instructions in English, German, and French. The application domain of TECHD O C is technical manuals, where one has to do with "real-world" texts: the domain is practical instead of a "toy world". At the same time, the language that is used in such manuals tends to be relatively simple; one mostly finds straightforward instructions that have been written with the intention to produce text that can be readily understood by a person who is executing some maintenance activity. Moreover, the structure of manual sections is largely uniform and amenable to formalization.
منابع مشابه
Intentions, Rhetoric, Or Discourse Relations? A Case From Multilingual Document Generation
The TECHDOC system [R5sner, Stede 92b] is an implemented prototype that starts from a domain knowledge base about maintenance plans, objects and actions involved, potential hazards etc. and delivers simultaneously generated instruction texts in at the moment three supported languages (English, German and French). Our approach is best characterized as an at tempt to demonstrate the feasibility o...
متن کاملExtraction of linguistic resources from multilingual corpora and their exploitation
Increasing availability of on-line and off-line multilingual resources along with the developments in the related automatic tools that can process this information, such as GIZA++ (Och & Ney 2003), has made it possible to build new multilingual resources that can be used for NLP/IR tasks. Lexicon generation is one such task, which if done by hand is quite expensive with human and capital costs ...
متن کاملMultilingual Drafting of Instructional Texts
This paper presents a new approach to computer assisted production of parallel instructions in multiple languages: multilingual generation. At the IT Research Institute at the University of Brighton, we are working on two major projects on multilingual generation of instructions, funded by the CEC and SERC/DTI and involving collaboration with groups of professional technical authors and transla...
متن کاملWelkin: Automatic Generation of Adaptive Hypermedia Sites with NLP Techniques
The demonstration shows the system Welkin, a multilingual system that analyses one or several source texts with a cascade of linguistic-processing modules, including syntactic analyses, text identification and text classification techniques, in order to fill in a database of information about it. That information is later used to generate on-thefly adaptive on-line information sites according t...
متن کاملGenerating Multilingual Documents from a Knowledge Base: The TECHDOC Project
TECHDOC is an implemented system demonstrating the feasibility of generating multilingual technical documents on the basis of a language-independent knowledge base. Its application domain is user and maintenance instructions, which are produced from underlying plan structures representing the activities, the participating objects with their properties, relations, and so on. This paper gives a b...
متن کامل